Rebalancing Docked Bicycle Sharing System with Approximate Dynamic Programming and Reinforcement Learning

نویسندگان

چکیده

The bicycle, an active transportation mode, has received increasing attention as alternative in urban environments worldwide. However, effectively managing the stock levels of rental bicycles at each station is challenging demand vary with time, particularly when users are allowed to return any station. There a need for system-wide management bicycle by transporting available from one another. In this study, rebalancing model based on Markov decision process (MDP) developed using real-time dynamic programming method and reinforcement learning considering system characteristics. pickup demands stochastic continuously changing. As result, proposed framework suggests best operation option every 10 min realized variables future predicted random forest method, minimizing expected unmet demand. Moreover, we adopt custom prioritizing strategies reduce number action candidates operator computational complexity practicality MDP framework. Numerical experiments demonstrate that outperforms existing methods, such short-term static lookahead policies. Among suggested strategies, focusing stations larger error prediction was found be most effective. Additionally, effects various safety buffers were examined.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximate Dynamic Programming and Reinforcement Learning

Dynamic programming (DP) and reinforcement learning (RL) can be used to address problems from a variety of fields, including automatic control, artificial intelligence, operations research, and economy. Many problems in these fields are described by continuous variables, whereas DP and RL can find exact solutions only in the discrete case. Therefore, approximation is essential in practical DP a...

متن کامل

Reinforcement Learning And Approximate Dynamic Programming For Feedback Control

feedback control of dynamic systems 6th solution PDF feedback control of dynamic systems 6th solutions PDF feedback control of dynamic systems 5th edition pdf PDF feedback control of dynamic systems solution PDF feedback control of dynamic systems 7th edition PDF feedback control of dynamic systems 6th edition PDF feedback control of dynamic systems solutions PDF feedback control of dynamic sys...

متن کامل

Editorial: Special Section on Reinforcement Learning and Approximate Dynamic Programming

Approximate dynamic programming (ADP) is to compute near-optimal solutions to Markov decision problems (MDPs) with large or continuous spaces. In recent years, the research works on ADP have been brought together with the reinforcement learning (RL) community [1-4]. RL is a machine learning framework for solving sequential decision making problems that can also be modeled as the MDP formalism. ...

متن کامل

A Public Bicycle Sharing System Considering Renting and Middle Stations

Recently, public bicycle sharing system (PBSS) has become one of the most favorite urban transportation systems that can help governments to decrease environmental problems such as pollution and traffic. This paper studies a sharing system that includes two types of stations. The first category contains stations that users can rent or return back bicycles and each bicycle can be rented by any n...

متن کامل

Optimal Learning and Approximate Dynamic Programming

Approximate dynamic programming (ADP) has emerged as a powerful tool for tackling a diverse collection of stochastic optimization problems. Reflecting the wide diversity of problems, ADP (including research under names such as reinforcement learning, adaptive dynamic programming and neuro-dynamic programming) has become an umbrella for a wide range of algorithmic strategies. Most of these invol...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Advanced Transportation

سال: 2022

ISSN: ['0197-6729', '2042-3195']

DOI: https://doi.org/10.1155/2022/2780711